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« VARIABLE BITRATE VIDEO CODING METHOD AND CO I >ING VIDEO 

CODER » 



The present invention relates to a variable bitrate video coding method and to a 
corresponding video coder allowing to carry out said method. 

As described in the document « MPEG Video coding : a basic tutorial introduc- 
tion », S.R. Ely, BBC Research and Development Report, BBC-RD- 1996/3, pp. 1-10. MPEG 
5 activities started in 1988 with the aim of defining standards for digital compression of video and 
audio signals. The first goal was to define a video algorithm for digital storage media such as 
the CD-ROM (Compact Disc Read-Only Memory), but the resulting standard was also applied 
in the Interactive CD system (CD-I), Allowing transmission and storage of picture data at bit 
rates in the range of i to 15 Mbits/s, this standard is based on a data compression achieved by 
10 using a block-based motion compensation for the reduction of the temporal redundancy and a 
discrete cosine transformation (DCT) for the reduction of the spatial redundancy. 

With conventional CD standards such as CD-I and CD-ROM, the transmission 
bit rate is fixed and pictures can therefore only be coded at a constant bit rate. New standards 
such as the Digital Versatile Disc (DVD) allow for transmission of data at a variable bit rate 
15 (VBR) ; complex scenes can be coded at a higher bitrate than scenes containing less informa- 
tion, in order to maintain a constant quality. 

An object of the invention is to propose a VBR video coding method allowing to 
obtain such a constant quality of the coded sequence, but under the constraint of a given total 
bit. budget (i.e. according to a predetermined target bitrate). 
20 To this end the invention relates to a VBR video coding method characterized in 

that, it comprises : 

(a) an analysts step, for coding a picture sequence with a constant quantization step and 
generating a bitstream for which the average bit rate of the sequence does not fulfil necessarily 
a required constraint of a specific size ; 
25 (b) a prediction step, for predicting the quantization stepsize which must be used to code the 
bitstream according to a predetermined target bit rate ; 

(c) at least one repetition of said analysis and prediction steps in order to get a more precise 
estimation of said stepsize ; 
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(d) a final control step, reiving on a comparison of the cumulative bit rate for alt previous 
frames with the cumulative predicted bit rate, in view of an adjustment of the stepsize so that 
said target bit rate is matched. 

Another object of the invention is to propose a VBR video coder provided for 
5 implementing said coding method. 

To this end the invention relates to a variable bit rate video coder characterized 
in that it comprises : 

(a) a coding branch, including in series at ieast a quantization circuit and a variable length 
circuit : 

30 (b) a prediction branch, including in series at the output of said quantization 

device at least an inverse quantization device and a prediction sub-system and defining by 

subtraction the signal to be coded by said coding branch ; 

(c) a control circuit provided for carrying out the implementation of the 

following operations : 
15 - a coding operation of a video sequence with a constant quantization stepsize ; 

-a prediction operation for an estimation of a quantization stepsize allowing to code the 

bitstream with a specified target bitrate ; 

- at least one repetition of said operations ; 

- a final controlling operation in order to match the total number of bits spent during the coding 
20 operation to said target bitrate. 

The advantages of the invention will now be shown in relation with the following 
description and the accompanying drawings, in which : 

- Fig.l shows some pictures of a group of pictures, and Figs 2 and 3 illustrate for such a typical 
group of pictures the difference between the display order of the pictures and their transmission 

25 order ; 

- Figs 4 and 5 show respectively the main steps of the coding method according to the 
invention and a corresponding architecture of video coder allowing to implement, said method ; 

- Fig. 6 shows a control loop used for the implementation of the final control step of said 
coding method. 

30 Before describing the coding method according to the invention, some basic 

principles of the MPEG-2 video standard may be recalled. The flexibility of this standard, 
intended to support a wide range of possible applications, is obtained thanks to the definition of 
>rotl i ■ „vir)g to suit all of the application requirements. A profile is a subset of 
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the MPEG-2 standard intended to support only the features needed by a given class of 
application, while a level defines a set of imposed constraints on parameters of the bitstream. 

The basic steps of the MPEG- 2 compression, applied to colour pictures 
consisting of three components (Y,U,V), concern frames that are divided into small subsec- 
5 tions, or macroblocks, themselves consisting of luminance and chrominance blocks. These steps 
are : motion estimation and compensation (based on 16 pixels x 16 lines macroblocks), discrete 
cosine transformation (based on 8 pixels x 8 lines blocks), run-length coding. 

Three types of frames, or pictures, are denned. Intra pictures (or I pictures) are 
coded without reference to other pictures, predictive pictures (or P pictures) are coded using a 
10 motion-compensated prediction from a past i or P picture, and bidirectional-predictive pictures 
(or B pictures) use both past and future I or P pictures for motion compensation. The motion 
information is given in the form of motion vectors obtained by implementation of a block- 
matching search (in which a large number of trial offsets are tested in the coder and the best 
one is selected on the basis of a measurement of the minimum error between the block being 
15 coded and the prediction). 

As indicated in Fig. 1 that illustrates how, on the basis of the motion vectors, the 
P and B pictures are denned, the different pictures typically occur in a repeating sequence 
which is termed a group of pictures., or GOP, and consists of an 1 picture and all succeeding 
pictures until the next I picture occurs. A typical GOP is shown in display order in Fig. 2 (the 
20 black arrows correspond to forward predictions and the white ones to backward predictions, 

! K n i f. \ ! is it I 1 1 I i 1 ! 1 >. 3 > 

P7, 110, P13 designate the re-ordered frames), said orders being different to enable at the 
decoding side backward predictions from future pictures. 

A regular GOP structure can be described with two parameters, N and M. The 
25 parameter N, defined as the size of a GOP, is, as shown in Fig.2, the number of pictures of said 
GOP, i.e. the number of pictures between two I pictures plus one. The parameter M is the 
spacing of P pictures, or (which is the same) the number of adjacent B pictures plus one. In the 
illustrated example of Figs 1 to 3, M = 3 and N = 9. Obviously other combinations are 
possible : 

30 Picture display order N M 

IPPPPPIPP 6 1 

IBPBPBPBI 8 2 
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4 , : 

IBBPBBPBBPBBIBBP 32 3 

N and M being chosen independently from each other. 

After a decision about the kind of macroblock compensation has been taken, the 
prediction error for each pixei of the concerned macroblock is obtained by subtracting the 
5 estimated macroblock from the original one. A DCT is then performed on the prediction error, 
for a biock of 8 x 8 pixels (hence six DCT transforms are determined for each macroblock : 
four for the luminance component, two for the chrominance components), and the frequency- 
components thus obtained are quantized The quantization stepsize determines the bitrate and 
the distorsion of the decoded image ; if the quantization is coarse, few bits are needed to code a 
10 picture, but the final quality is low, while, if the quantization stepsize is fine, many bits are 
needed to code the picture, but the quality is high. As the human eye is less sensible to the 
higher frequencies than to the lower ones, it is advantageous to use coarser quantizers for the 
high frequency components (in fact, in order to achieve the frequency dependent quantization, a 
weighting matrix is applied to a basic macroblock quantization parameter : a lot of coefficients, 
15 especially those at high frequencies, is equal to 0 after said weighted quantization). 

Each block is then zigzag scanned and the obtained list is coded. The run-length 
coding is done by determining a pair (A, NZ) where A designates the number of consecutive 
zeros (0 to 63) and NZ the amplitude of the following non-zero coefficient. 
A variable length code is then assigned to this pair (A,NZ), depending on the frequency of 
20 occurrence of this pair (a combination (A,NZ) which is common is assigned a short variable 
length code, whereas a pair which is less frequent is assigned a long one). 

The functions hereabove described are summarized for example in the document 
« Hybrid extended MPEG video coding algorithm for general video applications », C.T. Chen 
and al., Signal Processing : Image Communication 5 (1993), pp, 21-37, part 2.4, which includes 
25 the scheme of a generalized MPEG-2 coder. The aim of the VBR coding method now proposed 
is to guarantee that the size of the final bitstream will fit exactly on a storage medium with a 
fixed capacity like a DVD. A diagram of said method is given in Fig.4, and an architecture of 
video coder allowing to implement said method is illustrated in Fig. 5. 

The coding method is divided into three steps 41, 42 and 43, The first step 41 is 
30 an analysis one, in which a picture sequence is coded with a constant quantization stepsize Q c 
(and therefore with a constant quality). At the end of this step, a regular MPEG-2 compliant bit 
stream has be but the average bit rate of the i - ;e thus processed {i.e. 

the quotient of the total number of coded bits over the sequence by the total number of pictures 
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in that sequence), unknown before the end of said step, does not fulfil the required constraint of 
a specific size of the bit stream. 

The second step 42 is a prediction one, intended to predict the quantization 
stepsize Q which must be used to code the bistream according to the specified target bit rate. 
5 Once said prediction step is completed, the analysis step 4! may be repeated farrow in Fig. 4) as 
often as necessary in order to get a more precise estimation for Q (however, a good prediction 
is generally obtained after a few runs, for instance two). 

As the quantization stepsize Q available at the end of this second step is only an 
estimated value, the total bit budget is not exactly matched if every picture is coded by using 

10 said predicted value. The last step is a final pass 43 that allows to guarantee that the constraint 
on the total average bit rate is strictly observed To ensure that the final output bitstream has 
indeed exactly the desired size, a quantization stepsize control process is implemented. This 
process is based on a control loop relying on a comparison of predicted and teal bit rates. After 
the coding of each picture in the final step, the control process compares the total number of 

15 bits that have been spent with the allowed one. If more bits have been spent than the budget 
allows, the quantization stepsize is increased, and the bit rate of the following pictures is 
reduced. If fewer bits have been spent than the budget allows, Q is decreased and the bit rate is 
increased, the total target bit rate b<*ir_: 1 „ i 

Said VBR coding method is implemented in the coder of Fig. 5, where each 

20 block corresponds to a particular function that is performed under the supervision of a 

controller 55. The illustrated coder comprises in series an input buffer 51, a subtracter 549, a 
DCT circuit 521, a quantization circuit 522, a variable length coding circuit 523, and an output 
buffer 524. The circuits 521 to 524 constitute the main elements of a coding branch 52, to 
which a prediction branch 53, including an inverse quantization circuit 53 1 , an inverse DCT 

25 circuit 532 and a prediction sub-system, is associated. This prediction sub-system itself 

comprises an adder 541, a buffer 542, a motion estimation circuit 543 (said estimation is based 
on an analysis of the input signals available at the output of the buffer 51), a motion compensa- 
tion circuit 544 (the output signals of which are sent backwards to the second input of the 
adder 541), and the subtracter 549 (receiving both output signals of the buffer 51 and output 

30 signals of the motion compensation circuit 544, for sending their difference towards the coding 
branch). 

The output of the illustrated coder is sent towards the controller 55 that includes 
the control loop provided to carry out the final step 43. The main elements of said control loop 
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6 : 
for the final pass of the VBR coder are shown in Fig. 6. As already explained, it is necessary to 
adjust the quantization stepsize during this final coding pass, in order to ensure that the total 
target bit rate given by the operator is exactly matched. Said loop first comprises a first 
computation circuit 61 in which the output of the loop (i.e. the cumulative prediction error) is 
5 multiplied by a factor KP. This factor is itself equal to a constant QC (chosen by the operator) 
multiplied by a weighting factor Q^/APG, where Q m , is an integrative estimation of Q and APG 
the total number of bits for a GOP (of N frames). 

An adder 63 then adds the output Q.prop of said circuit 61 and the signal Q in! , 
available at the output of a second computation circuit 62 provided for yielding an integrative 

10 estimation of Q. A conversion circuit 64 gives the cumulative bitrate for all preceding frames, 
on the basis of a relation R - f,(Q) (between the quantization factor Q at the output of the 
adder 63 and R) stored in said circuit 64. The cumulative bitrare thus obtained is compared in a 
comparator 65 with the cumulative predicted bitrate available on a second input of said 
comparator and is used, after an integration in a circuit 66, in order to modify Q accordingly, 

15 The VBR coding strategy as presented above is an improvement with respect to 

previous VBR coders because it achieves a better equalisation of the perceptive quality of the 
decoded sequence. The classical VBR coders adjust, the quantization parameter Q while coding 
a frame, so that the predicted bit rate is matched for every picture. Hence they ailow the 
quantization parameter Q to vary inside a frame, and no constant spatial quality of the picture 

20 can be achieved. This variation in quality occurs whether the bit rate of the frame is correctly 
predicted or not. For the proposed VBR coder, Q is kept constant over a frame and the spatial 
quality of any picture in the video sequence does not vary. If the frame bit rates and the 
quantization stepwidth are correctly estimated, the Q before adaptive quantization, and 
therewith the subjective distortion, remains exactly constant for all macroblocks of the 

25 sequence. Since the quantization stepwidth and the frame bit rates are only estimated, a 

variation of Q, and hence of the quality of the sequence, occurs from frame, to frame, but, after 
two analysis passes, the deviations of Q, averaged over a frame., are generally below 1 %. 

Apart from the attainment of a constant intra-frame quality, several other 
important aspects of the new VBR strategy may be mentioned : 

30 - it is possible to improve the prediction of the quantization factor in an iterative way by 
increasing the number of analysis passes : if after the analysis run, the deviation from the 
wanted target bit rate is still too high, a better estimation for the quantization factor can be 
calculated using the results from the previous coding passes ; 
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- as the new VBR coding strategy predicts Q, analysis passes that are performed with another 
frame order than the predicted pass can be exploited : this is impossible using old strategies, 
and this is a major advantage of the new coding concept ; 

- if, in the final pass, the variations of Q and consequently of the quality turn out to be 

5 unacceptably high, the final step can be used as an analysis pass for the prediction of Q and of 
the bitrate for the subsequent pass : using this feature, it is possible to develop a coder that 
performs as many coding passes as needed until the characteristics of the output bitstream are 
within certain limits defined by the operator ; 

- as the control loop has an integrative character, short-term bit rate prediction errors cancel 
10 each other out : therefore, systematic, frame-type dependent prediction errors do not seriously 

affect the performance of the proposed VBR coder. 
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1 . A variable bitrate video coding method characterized in that it comprises : 
(a) an analysis step, for coding a picture sequence with a constant quantization 

stepsize and generating a bit stream for which the average bit rate of the sequence does not fulfil 
necessarily a required constraint of a specific size ; 
5 (b) a prediction step, for predicting the quantization stepsize which must be used to 

code the bitstream according to a predetermined target bit rate ; 

(c) at least one repetition of said analysis and prediction steps in order to get a 
more precise estimation of said stepsize ; 

(d) a final control step, relying on a comparison of the cumulative bit rate for all 
10 previous frames with the cumulative predicted bit rate, in view of an adjustment of the stepsize 

so that said target bit rate is matched. 

2. A variable bit rate video coder characterized in that it comprises : 

(a) a coding branch, including in series at least a quantization circuit and a variable length 
circuit ; 

15 (b) a prediction branch, including in series at the output of said quantization 

device at least an inverse quantization device and a prediction sub-system and defining by 
subtraction the signal to be coded by said coding branch ; 

(c) a control circuit provided for carrying out the implementation of the 
following operations : 

20 - a coding operation of a video sequence with a constant quantization stepsize ; 

- a prediction operation for an estimation of a quantization stepsize allowing to code the 
bitstream with a specified target bitrate ; 

- at least one repeti tion of said operations ; 

- a final controlling operation in order to match the total number of bits spent during the coding 
25 operation to said target bitrate. 

3. A variable bitrate video coding device according to ciairn 2, characterized in that 
said controlling operation is carried out by means of a control loop thai modifies said stepsize 
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according to the result of a comparison between i u i - s for all previous frame* 
and the cumulative predicted bitrate. 
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